Towards Adaptive Multi-Robot Coordination Based on Resource Expenditure Velocity: Extended Version
نویسندگان
چکیده
In the research area of multi-robot systems, several researchers have reported on consistent success in using heuristic measures to improve loose coordination in teams, by minimizing coordination costs using various heuristic techniques. While these heuristic methods has proven successful in several domains, they have never been formalized, nor have they been put in context of existing work on adaptation and learning. As a result, the conditions for their use remain unknown. We posit that in fact all of these different heuristic methods are instances of reinforcement learning in a one-stage MDP game, with the specific heuristic functions used as rewards. We show that a specific reward function—which we call Effectiveness Index (EI)—is an appropriate reward function for learning to select between coordination methods. EI estimates the resource-spending velocity by a coordination algorithm, and allows minimization of this velocity using familiar reinforcement learning algorithms (in our case, Q-learning in one-stage MDP). The paper analytically and empirically argues for the use of EI by proving that under certain conditions, maximizing this reward leads to greater utility in the task. We report on initial experiments that demonstrate that EI indeed overcomes limitations in previous work, and outperforms it in different cases.
منابع مشابه
Towards Adaptive Multi-Robot Coordination Based on Resource Expenditure Velocity
In the research area of multi-robot systems, several researchers have reported on consistent success in using heuristic measures to improve loose coordination in teams, by minimizing coordination costs using various heuristic techniques. While these heuristic methods has proven successful in several domains, they have never been formalized, nor have they been put in context of existing work on ...
متن کاملDesign of an Adaptive Fuzzy Estimator for Force/Position Tracking in Robot Manipulators
This paper presents a stable new algorithm for force/position control in robot manipulators. In this algorithm, position vectors are measured by sensors and then used in the control law. Since using force sensor has some issues such as high costs and technical problems, an approach is presented to overcome these issues. In this respect, force sensor is replaced by an adaptive fuzzy estimator to...
متن کاملTowards a Probabilistic Roadmap for Multi-robot Coordination
In this paper, we discuss the problem of multirobot coordination and propose an approach for coordinated multi-robot motion planning by using a probabilistic roadmap (PRM) based on adaptive cross sampling (ACS). The proposed approach, called ACS-PRM, is a samplingbased method and consists of three steps including Cspace sampling, roadmap building and motion planning. In contrast to previous app...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008